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PRODUCTION OF HXJMAN PROUROKINASE 

The present invention relates to a recombinant DNA 
method of producing non-glycosilated single-chain 
prourokinase (hereinafter referred to as proUK). More 
particularly, it relates to a method of producing 
non-glycosilated proUK which comprises recovering mRNA 
from an established cell line, preparing cDNA based on 
said mRNA, inserting the cDNA into a vector, 
introducing the resulting plasmid into a bacterial cell 
to thereby produce a trasforraant and recovering 
non-glycosilated proUK from said bacterial cell. 
The invention concerns also certain expression plasmids 
employed in the above method. 

Introduction 

The increasing knowledge of the molecular interactions 
that regulate physiological fibrinolysis has lead to 
important implications in the understanding of the 
mechanisms that dissolve blood clots, and in the 
development of new thrombolytic agents. 
In the human fibrinolytic system a proenzyme, 
plasminogen, can be activated to the active enzyme,' 
Plasmin, by several types of plasminogen activators 
(Collen, D. and Lijnen, H.R., CRC Critical Reviews in 
oncology /hematology, 4, n. 3, p. 249, 1986; Verstraete, 
M. and Collen, D. : Blood, 67, n. 6, p. 1529, 1986).' 
Plasmin is the major protease responsible for the 
degradation of the fibrin component of a blood clot 
(Rakoczi, I., Wiman, B. and Collen, D. : Biochim. 
Biophys. Acta, 540, p. 295, 1978; Robbins, K.C. 
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Sunmaria, L., Hsieh, B. ; and Shah, R.S.: j. Biol. Chem. 
242, p. 2333, 1967; Wiman, B,; Eur. J. Biochein, 76, p, 
129, 1977), 

However, plasmin can also exert its proteolytic effect 
on several plasma proteins among which the components 
of the coagulation pathway fibrinogen, factor V and 
VIII (Collen, D. and Lijnen, H.R. , CRC Critical Reviews 
in oncology/hematology, 4, n- 3, p. 249, 1986; 
Verstraete, M. and Collen, D. : Blood, 67, n. 6, p. 1529, 
1986; Wiman, B., Lijnen, H.R. and Collen, D. ; Biochim, 
Biophys. Acta, 579 . p. 142, 1979). 

Activation of plasminogen may occur at the systemic 
level, leading to circulating plasmin that is rapidly 
neutralized by alf a2-antiplasrain and thus not available 
for fibrinolysis (Collen, D. and Lijnen, H.R., CRC 
Critical Reviews . in oncology/heraatology, 4, n. 3,p, 
249, 1986; Verstraete, M. and Collen, D. : Blood, 67, n. 
6, p. 1529, 1986). 

When the alf a2-antiplasrain level is markedly reduced, 
plasmin is less rapidly neutralized and can exert its 
proteolytic effects not only on fibrin, but also on the 
blood coagulation proteins as described previously. 
Excessive lowering in the plasma concentrations of 
fibrinogen, factor V and VIII, together with the 
inhibitory effects exerted by some of the fibrinogen 
degradation products on the hemostatic process, on 
platelet aggregation and on fibrin polymerisation lead 
to hemostatic deficiency and subsequently to high 
bleeding risk (Latallo, Z.S. and Lopaciuk, s.; 
Throrabos. Diath. Haemouh., 56, p. 253, 1973; Totty, 
W.G., Gilula, L.A., Mc. Clennman, M, . Ahmed, P., and 
Sherman, L. Radiology, 143, p. 59, 1982). On the other 
hand, activation of plasminogen may occur at the fibrin 
level (fibrin-bound plasminogen activation) leading to 
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fibrin-bound plasmin (Collen, D. and Lijnen, H.R. , CRC 
Critical Reviews in oncology /hematology, 4, n. 3, p. 
249, 1986; Verstraete, M. and Collen, D. : Blood, 67, n. 
6,p. 1529, 1986) which is, instead, not affected by 
alfa2-antiplasmin and cannot induce systemic 
f ibrinogenolysis . 

Urokinase and streptokinase, the most commonly used 
plasminogen activators in conventional thrombolytic 
therapy in man,, have no specific activity for fibrin. 
Both compounds activate relatively indiscriminately 
either circulating or fibrin-bound plasminogen 
(Zamarron, C, Lijnen, H.R. , Van Hoef, B. , and Collen, 
D., Thromb. Haemostas. 52, p. 19, 1984; Samaraa, M. , and 
Kher, A. Sem. Hop. Paris. 61, n. 20. p. 1423, 1985). 
Therefore, the systemic haemostatic breakdown often 
encountered during treatment with streptokinase and 
urokinase and, consequently, the elevated bleeding risk 
have often hampered the widespread clinical use of 
these thrombolytic agents, despite their demonstrated 
clinical efficacy (Saraama, M. , and Kher, A. Sem. Hop. 
Paris, 61, n. 20. p. 1423, 1985; Maizel, A.S., and 
Bookstein , J. J . : Cardiovasc . Intervent . Radiol . , 9 , p . 
236, 1986; Bell, W.R. Thromb. Haemostas., 35, p. 57, 
1976; Acar, J., Vahanian, A., Michel, P.L., Slajna, M. , 
Cormier, B. and Roger, v.. Seminars in Thromb. and 
Haemost., l^, n. 2, p. 186, 1987; Gruppo Italian© per 
lo studio della Streptochinase nell»infarto miocardico 
CGISSI); Lancet, 1, p. 397, 1986). 

On the contrary, tissue-type plasminogen activator 
(t-PA) (Hoylaerts, M., Ryken, D.c, Lijnen, H.R. and 
Collen, D.J. Biol. Chem. , 257, n. 6, p. 2912, 1982), 
and more recently prourokinase (pro-UK) (Husain, S.S., 
and Gurewich, V., Arch. Biochem. Biophys. 220 p. 3i, 
1983), both natural proteins, were shown to be weak 
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activators of the circulating plasminogen and^ 
conversely, strong activators of the fibrin-bound 
plasminogen, without either systemic haemostatic 
breakdown or consumption of alf a2-antiplasmin and 
plasminogen, thus their clinical use may cause lesser 
bleeding risk. 

The fibrin-specific thrombolytic activity of t-PA has 
been explained by its ability to bind fibrin through 
specific lysine binding sites, located in the 
triple-disulfide-bonded "kringle domains" of the 
molecule • 

Consequently, fibrin-bound plasminogen could "be 
activated without significant haemostatic breakdown 
(Collen, D., and Lijnen, H.R.: Haemostasis; 1^, n. 3, 
p. 25, 1986). On the other hand, proUK (also 
denominated single chain urokinase type plasminogen 
activator, scu-PA) does not bind to fibrin, however it 
displays fibrin- specific thrombolytic activity without 
systemic hemostatic consumption (Pannell, R. and 
Gurewich, V,, Blood, 67, p. 1215, 1986; Gurewich V., 
and Pannell, ; Seminars in Thromb* and Haemost, , 13 , 
n, 2, p. 146, 1987; Lijnen, H.R. , Zamarron, Blaber, 
Winbler, M.E., and Collen, D. , J. Biol. Chem. 261, 
p. 1253, 1986). 

Recombinant t-PA was submitted to multicenter clinical 
trials in patients with acute myocardial infarctions 
and was shown to be significantly more effective than 
streptokinase in the recanalization of obstructed 
coronary arteries (The European Cooperative Study Group 
for Recombinant Tissue-type Plasminogen Activator; 
Lancet, 1 p. 842, 1985; Sheehan, F.H. Braunwald, E., 
Canner, P., Doodge, H.T., Gore, J., Van Natta P., 
Passamani, E.R. Williams, D.O. , Zaret, B.: Circulation, 
75, 4, p. 817, 1987). 
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Prourokinase is at present in advanced clinical trials 
and is thought to be, at least, as effective as t-PA in 
terms of thrombolytic activity and safety (Van de Werf 
P., Nobuhara, M. , and Collen, D. : Annals of Internal 
Medecine, 104, p. 345, 1986; Van de Werf F 
Vanhaecke, j. , De Geest, H. , Verstraete, M.' and 
Collen, D.: Circulation, 74, n. 5, p. 1066, 1986} 

Background of the invention 

urokinase- type plasminogen activators (u-PAs) are found 
in at least three different forms in human urine 
plasma and conditioned culture medium from a variety of 
cell lines. The first form to be characterized as u-PA 
consisted of a f ibrinolytically active polypeptide of 
410 amino acids, with an apparent moelcular weight of 
54000 daltons, containing two disulf ide-linked chains 
(Gunzler, W.A. , Steffens, G. J. , Getting, F. , Kim, SM.A. 
Frankus, E., and Flohe, l.: Hoppe - Seyler's z 
Physiol. Chem. 363, p. 1155, 1982). 

The A-chain or light chain contains 157 amino acids and 
one triple disulf ide-bonded "kringle" structure. This 
chain also contains a receptor binding domain for 
normal and neoplastic cells (monocytes, monocyte-like 
cells and A 431 epidermoid cells). The B chain or heavy 
chain (30000 daltons) consists of 253 amino acids and 
contains the catalytic domain. 

This molecular form of u-PA is generally termed 
urokinase (UK), two . chain urokinase (TC-UK) or high 
molecular weight urokinase (HMW- UK) (Gunzler, WA 
Steffens, G;J., Getting, F. , Buze, g. , and Flohe 
Hoppe-Seyler's Z. Phisiol. Chem. 363, p. 133, 1982) 
The second form of u-PA has a molecular weight of 33000 
daltons and results from proteolytic degradation of the 
HMW form by either plasmin or trypsin, it is called low 
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molecular weight urokinase (LMw-uk). Protein sequence 
determinations have revealed that LMW-UK is identical 
to HMW-UK except for the absence of the NH2- terminal 
135 amino acids that are specifically removed by the 
action of plasrain or trypsin (Steffens, Gunzler, 
W.A. Getting, F., Frankus, E. , and Flohe, L., 
Hoppe-Seyler's Z. Phisiol. Chem. , 363, p. 1043, 1982). 
Native prourokinase (proUK) is a single chain (54000 
daltons) form of urokinase and is also termed single 
chain urokinase type plasminogen activator (scu-PA). As 
stated before, proUK displays a fibrin-specific 
thrombolytic activity and is therefore a better 
thrombolytic agent compared to the presently used high 
or low molecular weight urokinases. 

In order to produce prourokinase, the authors of the 
present invention have developed a recombinant DNA 
procedure which allows the preparation of large amoiints 
of the proUK polypeptidic chain. 

Several methods have been described in the scientific 
and patent literature for the production of proUK 
(Holmes, W.E., Pennica, D. , Blaber, M. , Rey, M.W., 
Gunzler, W.A., Steffens, G.j. and Heynecker, H.L,; 
Biotechnology, 3 , p. 923, 1985; European Patent 
Application 0092182) • However, the method described 
v/ithin the text of the present invention exploits 
parameters known to be important for the expression of 
heterologous proteins in E. coli , but whose combination 
has never been applied before to the production of 
recombinant proUK. 

The main parameters , v/hose combination contributes to 
the establishment of a recombinant strain of E. coli , 
able to produce proUK, and which represents the object 
of the present invention, are the E, coli promoter 
Ptrp, the Shine -Dalgarno sequence MS -2 from the phage 
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MS-2, and coli strains of the type B as hosts for 
the expression of the human proUK gene (see below). 
Such combination is crucial. Substitution of one of 
these parameters with an alternative expression signal 
may not yield as much proUK. 

Accordingly, object of the present invention is a 
method for the preparation of non glycosilated pro-UK, 
characterized in that non-glycosilated pro-UK is 
expressed under the control of the E . coli promoter 
Ptrp and the Shine-Dalgarno sequence MS-2 by E. coli B. 

Description of the production procedure 

The present invention relates to the construction, by 

genetic engineering techniques, of strains of e. coli 

able to express the human proUK gene at high levels. 

Consequently, these recombinant strains are able to 

synthesize large amounts of the. proUK polypeptidic 

chain. 

In order to isolate said recombinant strains of the 
bacterium E. coli. it is necessary to go through a 
number of steps including: 

the isolation of the human cDNA gene coding for 
the desired proUK 

the insertion of said gene in an appropriate 
expression plasmid 

the transformation of selected strain of e . coli 
with the engineered plasmid and the cultivation of 
the transformants in appropriate conditions. 

1) Cloning of the human cDNA gene coding for p rnTTif 
To obtain the cDNA clone coding for hvunan proxirokinase. 
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the authors have utilized the proteins sequence data 
published in the literature (Gunzler, W.A., Steffens, 
G.J., Getting, F., Kim, SM.A. , Frankus. E. , and Plohe, 
L.: Hoppe - Seyler's Z. Physiol. Chem. , 363, p. 1155, 
1982; Gunzler, W.A., Steffens, G.J., Getting, F. , Buze, 
G., and Flohe, L. : Hoppe-Seyler • s Z. Phisiol. Chem. 
363, p. 133, 1982; Steffens, G.J., Gunzler, W.A. , 
Getting, F. , Frankus, E. , and Flohe, L., Hoppe-Seyler • s 
Z. Phisiol. Chem., 363, p. 1043, 1982). 
Accordingly, specific probes have been prepared and an 
appropriate cDNA library has been screened. 
Oligonucleotides coding for selected peptides of 
single-chain urokinase-type plasminogen activator 
(pro-UK) were chemically synthesized (Caruthers, M.H. , 
Gassen, H.G. and Lang, J.A. (eds) Verlag-chimie , 
Weinheim, Deefield. Beach, Basel, p. 71, 1982) to serve 
as specific probes to monitor enrichment of proUK mRNA 
and to select for clones containing prourokinase cDNA 
from an enriched cDNA library. The oligomers were 14 
to 17 mer in lenght, and each one was synthetized 
either as unique sequence (named p7) or in pools 
containing two (named pi, p2, p3) or 16 (named p6) 
oligonucleotides as indicated in Fig. 1. The oligomers 
were tested for specificity to proUK by northern 
hybridization. For this analysis polyA-containing RNA 
was extracted from the HEp-3 epidermoid carcinoma 
(Miskin, R. , Haeraostasis (Switzerland), 11, No. suppl 
1, p. 63, 1982). For each oligomer the temperature of 
the washing following the hybridization reaction was 
adjusted so as to be 2 to 5 degree C below the minimal 
melting temperature, as calculated according to Suggs 
et al. for hybridization to southern blots (Suggs, 
S.V., Hirose, T. , Miyake, T. , Kawashima, E.G., Johnson 
M.Y., Itakura, K. and Wallach, R.B., Developmental 
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Biology Using Pxirified Genes; Brovm D.D. and Fox, C.F. 
(eds) Academic Press, New York, p. 638, 1981). in this 
text the five proUK probes, shown in Fig. i, reacted 
with one common major carcinoma mRNA band of about 2.3 
kb, which is the size expected for proUK mRNA. 
Cloning took place using enriched mRNA fractions from 
the HEp-3 epidermoid carcinoma. RNA preparations were 
extracted and enriched about 3 fold on two successive 
sucrose gradients. cDNA was synthesized according to 
published procedures ( Ef stratiadis , A., Kafatos. F c 
Maxam, A.M. and Maniatis, T. Cell, 7, p. 279, 1976; 
Buell, G.N., Wickens, M.P. , Payvar, F. and s'chimke,' 
R.T. J. Biol. Chem. 253, p. 2471, 1978) using oligo-dT 
as a primer. Longer molecules were isolated using 
polyacrilamide gel electrophoresis followed by 
electro-elution of the appropriate gel fractions. The 
CDNA was then extracted using standard 
phenol/chloroform extraction followed by ethanol 
precipitation. 

These CDNA molecules were first ligated to EcoRl 
linkers and then cloned into the phage XgtlO vector 
according to a modification of the technique of Davis 
(Maniatis, T., Fritsch, E.F. , Sambrook, j. : Molecular 
cloning. A laboratory manual. Cold Spring Harbour 
Laboratory. Cold Spring Harbour, NY, 1982). By doing 
so, a library containing 2x10- pfu (plague forming 
units) was constructed. 

Half of the library was screened on duplicate filters, 
one filter with labelled probe pi, and the 

counterpart filter with a mixture of probes p3 and p6. 
A total of 36 positive clones were obtained, seven of 
which were positive in the duplicate filters, thus 
indicating cDNA inserts corresponding to a large 
portion of the proUK coding sequence. 

Recombinant phages that hybridized with the 3 probes 
were plagues purified using probe pi, and further 
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characterized by restriction mapping with EcoRI and by 
DNA sequencing. The fraction of the positive clones in 
the total cDNA library indicated that the frequency of 
prourokinase mRNA in the HEp-3 carcinoma is 
approximately 0.01%. 

Sequence analysis of four proUK cDNA clones revealed 
that three of the clones had deletions or sequences not 
consistent with the amino acid sequence of the enzyme. 
Only one clone, Aucl7, had a sequence with complete 
concordance with the known amino acid sequence. 
However , A Uc 17 did not include the entire 3' non-coding 
end of the mRNA and was missing 30 nucleotides of the 
coding sequence. A full lenght pre-prourokinase cDNA 
clone was constructed by ligation of a 1325 bp ^Ucl7 
Smal-BamHI fragment, containing the 5' non-coding 
region and the majority of the coding sequence, with a 
BamHI -EcoRI fragment containing the remaining missing 
3' region from another clone A Uc 6 (Fig. 2). 
This construct was ligated into the Smal-EcoRI site of 
the plasmid vector pUN121 (Nilsson B., Uhlen M., 
Josephson S., Gatenbeck S. and Philipson L. , Nucleic 
Acid Research 11, p. 8019, 1983), eliminating, thus, 
most of the cl gene, and given the name pcUK176 (Fig. 
3). 

The DNA sequence of the complete cDNA clone is depicted 
in Fig. 4. It consists of 2296 nucleotides, including 
69 non-coding nucleotides at the 5' end, 1296 coding 
nucleotides, and 931 non-coding nucleotides at the 3' 
end, followed by a poly (A) tail of more than 80 
residues . 

The coding sequence starts with 60 bp coding for 20 
amino acids comprising the "pre-prourokinase" 
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(Heyneker, H.L. , Homes, W.E. and Vehar, G.A. (1983) 
European Patent Application Publ. No. 0092182), and is 
followed by the sequence coding for the entire 
prourokinase protein, which is in complete concordance 
With the amino acid sequence. 

The complete fragment has been checked by sequence and 
restriction analysis. The sequence coding for mature 
prourokinase has been inserted into the expression 
vector used for production. 

Construction of the nrn UK expre5..^<nn olasmiH 
The original full lenght cDNA, present in pcUK176, was 
used to construct a prourokinase expression plasmid, 
named pFC44, in which the proUK gene is under the 
transcr_ptional and the translational control of the 
promoter Ptrp and of the "Shine-Dalgarno" sequence 
MS-2, respectively. The plasmid pPC44 is shown in fig. 

in order to obtain pFC44, several intermediate plasmids 
were constructed. Starting with pDS20 (Fig. 5)(Duester 
G., Helfard, R.m. and Holmes, W.M. : cell 30, p. 355' 
1982), we have first replaced the EcoRl-Hindlli 
fragment coding for the galactose operon promoter Pgal 
with the EcoRl-Hindlll polylinker sequence from the MI3 
mpS vector (Vieira, J. and Messing, i. Gene, 19, p. 

259, 1982), obtaining a new plasmid, named pABl ('pig. 
5) • 

The promoter Ptrp has been obtained from the plasmid 
PDR720 (bought from Pharmacia) as an EcoRI-Sall 
restriction fragment. This fragment has been inserted 
xn the polylinker region of pABl between the EcoRI and 
the sail site. By doing so, we have obtained a new 
plasmid, named pFClO (Fig. 5). 

PFCIO can be considred as the base vector into which we 
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have inserted the proUK gene as well as the 
"Shxne-Dalgarno" sequence from the phage MS- 2. 
To achieve expression of mature protirokinase , it is 
necessary to fuse the proUK coding sequence, from the 
first codon of the mature protein to the initiator 
triplet ATG, This fusion must then be preceded by the 
"Shine-Dalgarno" sequence. 

The ribosome binding site (RBS) from the bacterial 
phage MS-2 was known and its nucleotide sequence had 
already been published (Fiers, w. , Contreras, R., 
Duerinck, P., Haegeman, G. , Iserentant, D. , Merregaert,' 
J., Min Jou, W., Molemans, F. , Raeymaekers, A., Van den 
Berghe, A., Volckaert, G. and Ysebaert, M. Nature, 260, 
p. 500, 1976). ' ' 

It is thought to be a strong signal for an efficient 
translation of the mRNA. Therefore, we have chosen this 
region as translation signal for the production of 
proUK. In order to obtain the correct nucleotide fusion 
with the proUK gene, we have synthesized a double 
strand DNA region of the MS-2 RBS directly joined to 
the beginning of the proUK gene. A TaqI site is present 
on the 25th nucleotide of the mature proUK sequence. We 
have taken advantage of this site and isolated, by 
chemical synthesis, the following DNA sequence: 

Hindi II 

Metser 

5 • -AGCTTTAATAGACGCCGGCCATTCAAACATGAGGATTACCCATGAGCA 
3 ' -AATTATCTGCGGCCGGTAAGTTTGTACTCCTAATGGGTACTCGT 

TaqI 

ATGAACTTCATCAAGTTCCAT-3 ' 
TACTTGAAGTAGTTCAAGGTARr.- «; ' 



is flanked upstream by an Hindlli site and down 
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stream by a TaqI site. The initiator codon ATG is shown 
in bold face. The sequence coding for the beginning of 
the mature proUK sequence is underlined. 
The synthetic fragment has been used in a ligation 
reaction with the two following restriction fragments: 
-the Tagl-Bglll fragment from pcUK176 (Fig. 3), which 
carries the proUK sequence from nucleotide 155 to 
nucleotide 392 (see Fig. 4) ; 

-the large BamHI-Hindlll fragment from pFClO (Fig. 5), 
which carries the antibiotic resistance to ampicillin 
as well as the promoter Ptrp. 

Through this construction, we have isolated a new 
plasraid, named pAB8, whose schematic map is shown in 
fig. 6. In this plasmid, the promoter Ptrp and the MS- 2 
RBS are fused to the first 260 nucleotides of the 
mature proUK gene (corresponding to nucleotides 131-391 
in Fig. 4). In addition, pAB8 has a unique Ncol site 
into which we have inserted the rest of the proUK 
sequence through a Ncol -Ncol restriction fragment from 
PCUK176. This ligation has caused the duplication of an 
Ncol-Bglll fragment downstream of the proUK gene in the 
non-coding region. However, this duplication does not 
affect plasraid stability. Through this construction 
signals can now direct the synthesis of the complete 
proUK sequence (see Fig. 6). 

All the plasmids, described so far, were selected in 
the E . coli K-12 host strain C-600 galK (ATCC 33955), 
on the basis of ampicillin resistance. Indeed, they 
carry the gene coding for 6 -lactamase, the enzyme 
responsible for the degradation of the antibiotic 
ampicillin in the culture medium. Early experiments 
have shown that pFC16 could he successfully inserted in 
^- type B strains and cause high level production 

of recombinant proUK- 



wo 90/04023 



PCr/EP89/01168 



14 



However, to comply with international guidelines for 
the production of recombinant DNA-derived products, we 
have modified pFC16 to create a new 
tetracycline-resistant plasraid able to express the 
proUK gene at high levels. In particular, from the 
well-known plasmid pBR322 (Maniatis, T. / Fritsch, E.F. 
Sambrook, J.: Molecular cloning. A laboratory manual. 
Cold Spring Harbour Laboratory. Cold Spring Harbour, 
NY. 1982) (Fig. 6), we have isolated a EcoRI-Aval 
fragment where the sticky ends were filled in using the 
klenow fragment of DNA polymerase I {Perbal,B., A 
Wiley-Interscience Pviblication John Wiley and Sons, p. 
231, 1984). This fragment was ligated to the larger 
Aatll-Pvul fragment from pFC16, whose ends were made 
blunt by DNA polymerase I (Perbal, B., A 
Wiley-Interscience. Publication John Wiley And Sons, p. 
231, 1984). By doing so, we have replaced the amino 
terminal portion of the fl-lactamase gene and its 
controlling sequence with the tetracycline-resistance 
gene. Following ligation the tetracycline resistance 
gene is -in the same orientation as the proUK gene. 
Moreover, a new EcoRl site has been created at the 
jxmction between the Pvul and EcoRI sites, previously 
filled in. The new plasmid, pFC44 (see fig. 7), is the 
final construction that has been used for the 
production of recombinant prourokinase . 
Plasmid pFC44 (tetracycline-resistant) and pFC16 
(ampicillin resistant) are one of the objects of the 
present invention. The expression signals, present in 
these two plasmids, namely the promoter Ptrp and the 
Shine -Dalgarno sequence "MS-2", have already been 
described" in the literature for the expression of 
heterologous proteins ( Remaut E . , Stranssens P . and 
Fiers W. Nucl. Acid. Res. 11, p. 4677, 1983). However, 
their combination has never been applied before to the 
expression of the proUK gene . 
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3) Transfor mation of E.coll tvoe B -strains 
The second main object of the present invention is the 
use of E. coli strains of the type B for the expression 
and production of prourokinase . Indeed, the authors of 
the present invention have found that insertion of 
Plasraids PFC16 or pFC44 in type B strain of the 
bacterium E. coli brings to high level productions of 
the proUK polypeptidic chain. Interestingly, insertion 
of plasmids PFC16 or pFC44 in other strains of e. coli 
(type K-12, type C, type W, etc) does not yield as much 
proUK. consequently, the host strain type seems to be 
crucial for the successful production of proUK. 
Several type B strains of E. coli are available and can 
be used for successful expression of the proUK gene. 
Prefered strains are: ATCC 12407, ATCC 11303, NCTC 
10537 . Below is an example of transformation of strain 
NCTC 10537 with plasmid pFC44 and subsequent 
cultivation of the transformant. 

Competent cells of strain NCTC 10537 were prepared 
using the calcium chloride procedure of Mandel and Higa 
(Mandel, M. and Higa, A., j. moI. Biol. 53, p. 154 
1970). Approximately 200 >tl of a preparation of these 
cells at 1 X 10» cells per milliliter were transformed 
with 2>i.l of plasmid DNA (approximate concentration 5 
>lg/ral). Transformants were selected on plates of L-agar 
containing 12.5 ^/ml tetracycline. Two small colonies 
were streaked with wooden tooth picks (each as three 
streaks about 1 cm long) onto L-agar containing the 
same antibiotic. After 12 hours incubation at 370c, 
portions of the streaks were tested for human 
prourokinase production by inoculation onto 10 ml of i£ 
medium (containing tetracycline at a concentration of 
2.5 ;^g/ml) and incubated overnight at 37«»c. The 
following day the cultures were diluted 1:100 in M9 
medium, containing the same concentration of 
tetracycline, and incubated for 6 hours at 37 "C. Total 
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cell proteins from 250 }Kl aliguots of cultxire (0*D. = 
1-1.5) were analysed by sodium dodecylsulf ate 
polyacrylamide gel electrophoresis as described by 
Laeramli (Laeramli, U.K. Nature, 227, p. 680, 1970). a 
major protein band having a molecular weight 
corresponding to that of non-glycosilated human 
prourokinase (45000 daltons) was observed for the two 
independent samples (Fig. 8). 

The set of streaks corresponding to colony no. 2 (clone 
2) was chosen arbitrarily for further characterization 
and then selected as a proUK producing strain. 

Materials and Methods 

Growth Media ; The media used were prepared using 
recipes as described by Maniatis etal. (Maniatis T 
Fritsch, E.F. Sambrook, J.: Molecular cloning. a 
laboratory manual. Cold Spring Harboiir Laboratory. Cold 
Spring Harbour, NY 1982). LB Mediiim, LB agar and 
MacConkey agar were prepared using Difcobacto Products. 
M9 medium is comprised of the following components: 
Na^HPO^, 6 grams/1; KH^PO^, 3 g/1: NaCl, 0.5 g/1; 
NH^Cl, 1 g/1. After sterilization of the above 
components by autoclaving (1 atm. ; 120 °C for 20 min* ) , 
1 ml of IM MgSO^, 0.1 ml of IM CaCla, 16 ml of 25% 
Glucose, 20 ml of 0.5 mg/ml Thiamine (SIGMA) and 20 ml 
of 20% Casamino Acids (DIFCO) were added per litre. The 
above solution is sterilized by filtration. 

Use o f Restriction Endonucleases and Other Enzymes r 
Restriction endonucleases, T4 DNA Ligase and DNA 
Polymerase I (Klenow Fragment) were obtained from New 
England Biolabs and from Boehringer Mannheim and were 
used in conditions recomended by the manufacturer. 

Preparat ion of Plasmid DNA ; Plasmid DNA was prepared by 
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a method involving dye-buoyant density centrifugation 
based on the procedure of Birnboim and Doly (Birnboim, 
H.C. and Doly. J., Nucleic Acid Res. 7, p. I5i3, 1979)' 

DNA sequence analysis ; Sequence data were obtained 
using the Araersham Ml 3 sequencing kit according to the 
instruction of the manufacturer. Briefly, this 
technique, based on the Sanger method (Sanger, F., 
science, 214, P. 1205, 1981) consists of subcloning 
various restriction fragments in convenient M13 vectors 
(the "mp family") which can be obtained in their single 
strand configuration. After annealing the single strand 
forms with "universal primers", it is possible to copy 
the DNA template with DNA polymerase I (klenow 
fragment). By copying the template in the presence of 
the four dideoxynucleotides it is possible to cause 
random terminations of the chain elongation: The 
truncated fragments are then separated on denaturing 
polyaerylamide gels and the electrophoretic profile is 
evidenced by autoradiography. 

Oligonucleotides! Synthetic oliognucleotides , utilized 
in plasmid constructions and as primers in DNA 
sequencing, were synthesized using the Applied 
Biosystem (ABI) DNA synthesizer 380B according to the 
ABI manual. 



Other procedures : Procedures for agarose and 

polyacrilamide gel electrophoresis of nucleic acids 
were as described by Maniatis et al. (Maniatis, T 
Fritsch, E.F. Sarabrook, j.: Molecular cloning, a 
laboratory manual. Cold Spring Harbour Laboratory. Cold 
Spring Harbour, NY, 1982). Proteins were separated by 
polyaerylamide gel electrophoresis as described by 
Laemmli (Laemmli, U.K. Nature, 227, p. 680, 1970). DNA 
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was extracted from agarose gels by electrophoresis into 
dialysis bags and was concentrated by ethanol 
precipitation • 

Legends to Figures 

Fig. 1: The nucleotide sequence of probes PI, P2^ 

P6 and P7, the complementary mRNA sequence and the 

proUK peptides coded by the probes are depicted. 

Fig. 2: The size and location of restriction 
endonuq lease cleavage products were estimated by 
electrophoresis and confirmed by DNA sequence analysis. 
The filled region indicates the coding sequence of the 
mature proUK protein, the cross-hatched region 
represents the "pre -pro" peptide coding sequence and 
the open regions indicate the 5 ' and the 3 ' 
non- translated sequences. The 5' end of the mRNA is to 
the left. The lines below the restriction map indicate 
the contribution of the two partial clones Aucl7 and 
Auc5. 

Fig. 3: The EcoRI-Smal fragment carrying the cDNA 
clone has been inserted in pUN121 replacing most of the 
cl gene. Plasraid pcUK176 is still tetracycline and 
ampicillin resistant. *cl represents an inactivated cl 
protein* 

Fig. 4: The complete cDNA sequence of clone pcUK176 
is depicted with the corresponding translated amino 
acid sequence. Restriction sites which have been used 
for plasmid constructions are underlined. Two 
polyadenilation sites at position 2264 and 2277 and the 
Serine residue at position 1 of the mature proUK 
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sequence are aZso underlined. 

Fig. 5: In this figure four intermediate 
constructions are depicted. The starting plasmid, 
PDS20, carries the general background which, through 
the different intermediate plasmids, looses the 
promoter Pgal, the galK gene and the B-lactamase gene. 

Fig. 6: Three additional intermediate constructions 
including plasmid pFC16 which expresses high level of 
proUK. For details, see the text. 

Fig. 7: In PFC44 the mature proUK coding sequence is 
under control of the promoter Ptrp and of the "ribosome 
binding site" from the phage MS-2. The tetracycline 
gene has been inserted at the place of the B-lactamase 
gene. pFC44 is therefore ampicillin sensitive. 

Fig. 8: The samples were prepared and analysed as 
described in the text and loaded on a 12 5% 
SDS-polyacrylamide gel (acryl to bisacrylamide ratio 
40:1). Lanes 1 and 2 contain material derived from two 
cultures of strain NCTC 10537 transformed with pFC44. 
The position of the recombinant prourokinase protein is 
indicated by the arrow. Lane 3 shows material derived 
from the control host strain NCTC 10537. A molecular 
weight standard is shown in lane 4. 

Discussion and conclusions 

The present invention relates to a recombinant DNA 
method for the production of non-glycosilated 
prourokinase. This method is based on the insertion of 
the human gene coding for proUK in bacterial strains of 
E; coli and the subsequent cultivation of said 
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transformed strains. 

The production of heterologous proteins in E. coli is a 
well studied field of modern biotechnology (Harris 
T.J.R. and Eintage J,S« Microbiological Sciences, 3, p. 
28-31, 1986). Today the molecular biologists dispose of 
several expression signals such as promoters. 
Shine -Dalgar no sequences, terminators, etc. that can be 
used for the protein of choice. The promoter is 
responsible for the synthesis of messenger RNA while 
the Shine-Dalgarno sequence should guarantee an 
efficient translation of the mRNA in polypeptidic 
chain. 

The combination, however, of these parameters is an 
important feature in the heterologous gene expression. 
For example, fusion of an efficient Shine-Dalgarno 
sequence to different promoter regions can lead to 
different expression levels. In addition, the lenght of 
the restriction fragments carrying the expression 
signals often affects the levels of production 
(McCarthy J.E.G., Sebald W. , Gross G. and Lammers R. ; 
Gene, 41, p. 201, 1986). 

The choice of the host strain is also a critical step 
in the development of an efficient method of 
production. It is, in fact, known that insertion of the 
same expression plasmid in different strains can lead 
to very different expression efficiencies (Harris 
T.J.R. and Emtage J.S. Microbiological Sciences, 3, p, 
28-31, 1986). ~' 
While the expression signals described in the present 
invention were already known in the scientific 
literature, their combination had never been exploited 
before for the specific expression of human 
prour ©kinase • More particularly, plasmids pFC16 and 
PFC44 carry the proUK gene under control of the E. coli 
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promoter Ptrp and the phage Shine-Dalgarno sequence 
MS-2. 

consequently, the production method, disclosed within 
the text of the present invention, is based on 
expression plasmids essentially different from other 
expression plasmids previously described. These 
plasmids, PFC16 and pFC44, represent therefore one of 
the novelty aspects of the present invention and, as 
already said, are an object thereof. 

In addition, the method here disclosed takes advantage 
of E. coli strains of the type B. The vast majority of 
the expression methods, described in the scientific 
literature, is based on strains of E. coli of the type 
K-12. Thus, the production of proUK in E^_coli strains 
of the type B represents another novelty aspect of the 
present invention.. 

This second aspect is extremely important. The choice 
of the host organism can, in fact, affect the global 
production process at several steps. 

For instance, fermentations at high biomass may 
dramatically be influenced by the type of host. The 
present inventors as well as other groups of 
researchers have consistently found that e. coli 
strains of the type B can be grown more easily than, 
e.g., K-12 strains. Insertion of the same expression 
Plasmids, PFC16 or pFC44, in K-12 strains such as C600 
generates recombinant strains, which cannot grow in 
fermentators, as efficiently as the recombinant b 
strains. m other words, yields of recombinant 
non-glycosilated pro-UK are higher from B strains, when 
using the same expression plasmids. 

Another important feature related to the choice of the 
host strain is the different nature of the bacterial 
contaminants during the pro-UK production process. 
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Undesired contaminants, such as proteases, could 
severely affect yields of the recombinant product ♦ 
Interestingly, in 1986, Winkler and Blaber 
(Winkler, M.E. , Blaber, M. : Biochemistry, 25, n. 14, p. 
4041, 1986) have described a pro-UK production process 
based on the K-12 strain 294 (ATCC 31446). 
In this process, the authors had to take several 
precautions to avoid proteolytic digestion of pro-UK, 
According to the authors these proteolytic activity was 
due to bacterial proteases from the host strain. 
In contrast the use of B strains according to the 
present invention yields cell extracts with much lower 
proteolytic activity. In particular, it has been found 
that pro -UK extracted from the K-12 strain C600 is 
contaminated by urokinase to a higher extent compared 
to pro-UK from B strains. 

In conclusion, the authors of the present invention, 
believe that the higher yields of recombinant pro-UK 
observed with the here described procedure compared 
with the prior art, represent an unpredictable result 
and an improvement over the knovm procedures. 
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Claims 



1. 



A metood for the preparation of non-gXyoosllated 

orr^', n=n-»lycosilated 
pro-UK Is expressed under the control of the E 

mi promoter Ptrp and the Shine-Oalgarno se™ 

by E. coll 

A method according to claim i wherein the 
non-glycosilated single chain prourokinase has a 
molecular weight of about 45000 daltons. 

The method according to claim 1, characterized in 
that the E. coli B expresses primarily the 
sequence of pro-UK. 

The method according to claim 1, characterized in 
that the cDNA sequence for pro-UK is obtained from 
inRNA of HEp-3 epidermoid carcinoma cells. 

Wie method according to claim 1. characterized in 
that the promoter Ptrp constitutes of an 
EcoRl-sall restriction fragment obtained from the 
plasmid pDR-720. 

The method according to claim 1, characterized in 
that the sequence comprising the Shine -Dalgarno 
sequence MS-2, the ATG start codon and the 
beginning of the pro-UK gene, flanked upstream by 
a Hxndlll site and downstream by a Tagi site is as 
follows : 
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Hindlll 

5 ' -AGCTTTAATAGACGCCGGCCATTCAAACATGAGGATTACCCATGAGC 
3 ' -AATTATCTGCGGCCGGTAAGTTTGTACTCCTAATGGGTACTCG 

TagI 

AATGAACTTCATCAAGTTCCAT-3 » 
TTACTTGAAGTAGTTCAAGGTAGC-5 ' 

?• Expression plasmid FC-16 according to Fig. 6, 
8. Expression plasmid FC-44 according to Fig. 7. 



1 
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FIG 

1 Smal CQ 
CCCGGGCTCCGGGCTGCGGTCKCTGCCGCAGCCACCGAGCCGCCGTCTAGCGCCCCGACCTC 

100 

GCCACC ATG AGA GCC CTC CTQ GCG CCC CTG CTT CTC TGC GTC CTB GTC 
Met Arg Ala Leu Leu Ala Arg Leu Leu Leu Cys Val Leu Val 

150 Taql 

GTG AGC GAC TCC AAA GGC A6C AAT GAA CTT CAT CAA GTT CCA TGC A AC 
Val Ser Asp Ser Lys Gly Ser Asn Glu Leu His Gin Val Pro Ser Asn 

T6T GAC TGT CTA AAT GGA GGA ACA TGT GTG TCC AAC AAG TAC TTC TCC 
Cys Asp Cys Leu Asn Gly Gly Thr Cys Val Ser Asn Lys Tyr Phe Ser 

AAC AAT CAC TGG TGC AAC TGC CCA AAG AAA TTC GGA G6G CAG CAc'^TGT 
Asn He His Trp Cys Asn Cys Pro Lys Lys Phe Gly Gly Gin His Cys 

GAA ATA GAT AAG TCA AAA ACC TGC TAT GAG G6G AAT GGT CAC TTT TAC 

Glu lie Asp Lys Ser Lys Thr Cys Tyr Glu Gly Asn Gly His Phe Tyr 

Ncol 350 

CGA GGA AAG GCC ACC ACT GAC A CC ATG G GC CGG CCC TGC CTG CCC TGG 

Arg Gly Lys Ala Ser Thr Asp Thr Met Gly Arg Pro Cys Leu Pro Trp 

AAC TCT GCC ACT GTC CTT CAG CAA ACG TAC CAT GCC CAC AGA^TCT GAT 
Asn Ser Ala Thr Val Leu Gin Gin Thr Tyr His Ala His Arg Ser Asp 

GCT CTT CAG CTG GGC CTC GGG AAA CAT AAT TAC TGC ACG AAC CCA GAC 

Ala Leu Gin Leu Gly Leu Gly Lys His Asn Tyr Cys Arg Asn Pro Asp 
450 

AAC CGG AGO CGA CCC TGG TGC TAT GTG CAG GTG GGC CTA AAG CCC CTT 

Asn Arg Arg Arg Pro Trp Cys Tyr Val Gin Val Gly Leu Lys Pro Leu 
500 

GTC CAA GAG TGC ATG GTG CAT GAC TGC CCA GAT GGA AAA AAG CCC TCC 

Val Gin Glu Cys Met Val His Asp Cys Ala Asp Gly Lys Lys Pro Ser 
550 

TCT CCT CCA GAA GAA TTA AAA TTT CAG TGT GGC CAA AAG ACT CTG AGG 

Ser Pro Pro Glu Glu Leu Lys Phe Gin Cys Gly Gin Lys Thr Leu Arg 

600 

CCC CCC TTT AAG ATT ATT GGG GGA GAA TTC ACC ACC ATC GAG AAC CAG 

Pro Arg Phe Lys He He Gly Gly Glu Phe Thr Thr He Glu Asn Gin 

650 

CCC TGG TTT GCG GCC ATC TAC AGG AGG CAC CGG GGG GGC TCT GTC ACC 

Pro Trp Phe Ala Ala H e Tyr Arg Arg His Arg Gly Gly Ser Val Thr 
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FIG.4 cont'd 

700 

TAC CTG TOT GGA GGC AGC CTC ATC AGC CCT TGC TGG GTG ATC AGC GCC 
Tyr Val Cys Gly Gly Ser Leu He Ser Pro Cys Trp Val He Ser Ala 

750 

ACA CAC TGC TTC ATT GAT TAC CCA AAG AAG GAG GAC TAC ATC GTC TAC 
Thr His Cys Phe He Asp Tyr Pro Lys Lys Glu Asp Tyr He Val Tyr 

800 

CTG 6GT CGC TCA AGG CTT AAC TCC AAC ACG CAA GGG GAG ATG AAG TTT 
Leu Gly Arg Ser Arg Leu Asn Ser Asn Thr Gin Gly Glu Met Lys Phe 

850 

GAG GTG GAA AAC CTC ATC CTA CAC AAG GAC TAC AGC GCT GAC ACG CTT 
Glu Val Glu Asn Leu He Leu His Lys Asp Tyr Ser Ala Asp Thr Leu 

900 

GCT CAC CAC AAC GAC ATT GCC TTG CTG AAG ATC CGT TCC AAG GAG GGC 
Ala His His Asn Asp He Ala Leu Leu Lys He Arg Ser Lys Glu Gly 

950 

AGG TGT GCG CAG CCA TCC CGG ACT ATA CAG ACC ATC TGC CTG CCC TCG 
Arg Cys Ala Gin Pro Ser Arg Thr He Gin Thr He Cys Leu Pro Ser 

1000 

ATG TAT AAC GAT CCC CAG TTT GGC ACA AGC TGT GAG ATC ACT GGC TTT 
Met Tyr Asn Asp Pro Gin Phe Gly Thr Ser Cys Glu He Thr Gly Phe 

1050 

GGA AAA GAG AAT TCT ACC GAC TAT CTC TAT CCG GAG CAG CTG AAA ATG 
Gly Lys Glu Asn Ser Thr Asp Tyr Leu Tyr Pro Glu Gin Leu Lys Met 

1100 

ACT GTT CTG AAG CTG ATT TCC CAC CGG GAG TGT CAG CAG CCC CAC TAC 
Thr Val Val Lys Leu He Ser His Arg Glu Cys Gin Gin Pro His Tyr 

TAC GGC TCT CAA GTC ACC ACC AAA ATG CTA TGT GCT GCT GAC CCA CAA 
Tyr Gly Ser Glu Val Thr Thr Lys Met Leu Cys Ala Ala Asp Pro Gin 

TGG AAA ACA GAT TCC TGC CAG GGA GAC TCA GCG GGA CCC CTC GTC TCT 
Trp Lys Thr Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys 

TCC CTC CAA GGC CGC ATG ACT TTG ACT GGA ATT GTG^GC TGG GGC CGT 
Ser Leu Gin Gly Arg Met Thr Leu Thr Gly He Val Ser Trp Gly Arg 

GGA TGT GCC CTG AAG GAC AAG CCA GGC GTC TAC ACG AGA GTC TCA CAC 
Gly Cys Ala Leu Lys Asp Lys Pro Gly Val Tyr Thr Arg Val Ser His 

TTC TTA CCC T GG ATC C GC ACT CAC ACC AAG GAA GAG AAT GGC CTG GCC 
Phe Leu Pro Trp He Arg Ser His Thr Lys Glu Glu Asn Gly Leu Ala 



SUBSTITUTE SHEET 



wo 90/04023 



PCT/EP89/01168 



6/9 



CATGGGTGAAGGGAGADCCAGCTCCCCCGACGGTGGGCAnTCTCAGCI 



FIG. 4 

cont'd 

CTC 154 ffiGTCCCCAGCGABCAMCCCCCACCACCCCCnTCTTGCTGGTTQTCATTrTT 

HAGrAGAGTCArCTCCATCAOCTOTAAIiAAMlACTCCfiAAGATAQliCTCTBCACABATCGA 

*1 ^ 0 0 

TTTGCCTGTGCCACCCACCAGGCTGAACGACAATAGCTTTACCCTCAGGCATAGGCCTGGGTG 
1550 

CTGGCTGCC[A0ACCCCTCTGGCCAG6ATGGAGGGGTGGTCCTGACTCAACATGT&^ 

AGCAACTTGTCTTTTTCTGGACT6AAGCCT6CAGGAGTTAAAMGG6CAGGGCATCTCCTGTG 

Ncol 
C CCATGG TTGAGAAA 

TGAATAATTTCCCAATTAGGAAGTCTAACAGCTGACGTCTCTTGAGGGAGCTTAGCCAATGTG 
1600 

GGAOCAGCGGnTGCGGGAGCAGAGACACTAACGACTTCAGGGCAGGCCTCTGATATTCCATG 

1900 

AATGTATCAGGAAATATATATGTGTGTGTATGTTTGCACACTTGTGTGTGGGCTGTGAGTGTA 

1950 

AGTGTGAGTAAGAGCTGGTGTCTCATTGTTAAGTCTAAATATTTCCTTAAACTGTGTGGACTG 
2000 

TGATGCCACACAGAGTGGTCTTTCTGGAGAGGTTATA6GTCACTCCTGGGGCCTCTTGGGTC? 

CCCACGTQACAGTGCCTGGGAATGTATTATTCTGCAGCATWCCTGTGACCAGCACTGTCTCA 
GTTTCACTTTCACATAGATGTCCCTTTCTTGGCCAGTTATCCCTTCCTTTTAQCCTAGTTCAT 

CCAATCCTCACTGGGTGGQGTGAGGACCACTCCTTACACTGAATATTTATATTTCACTATTTT 
2250 

TATTTATATTTTTGTAATTTTAAATAAMGTGATCAA^ 
C 
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FIG. 5 
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Pvul 



FIG. 6 
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FIG .7 
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